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DNA Amplification 

FIELD OF THE INVENTION 

5 The present invention relates to a method of amplifying in 
vivo a DNA sequence present in a genome of a cell, a cell 
harbouring multiple copies of said amplified DNA sequence in 
a genome and a vector harbouring a DNA construct to be used 
in the method. Furthermore, the present invention relates to 
10 a method of producing a polypeptide by culturing a cell as 
described above. 

BACKGROUND OF THE INVENTION 

15 A large number of naturally-occurring organisms have been 
found to produce useful products, the large scale production 
of which is desirable for research and commercial purposes. 
Once such product has been identified efforts are being made 
to develop production methods leading to a high production of 

20 the product. One widely used method, which is based on recom- 
binant DNA techniques, is to clone a gene encoding the pro- 
duct, inserting the gene into a suitable expression system 
permitting the expression of, the product and culturing a 
suitable host cell comprising the expression system, either 

25 integrated in the chromosome or as an extrachromosomal enti- 
ty, under conditions conducive for the expression of the pro- 
duct. However, a prerequisite for using such method is that 
the gene in question may be identified and cloned, and fur- 
ther that a suitable expression system and host cell for the 

30 production are available. 

Another approach which may be used for the production of such 
products is to culture the cell which in nature produces the 
product or a derivative of such cell under suitable condi- 
35 tions. However, a frequently recognized drawback of such met- 
hod is that the cell is not a suitable production organism, 
one reason being that the amount of product produced by such 
cell is too low to be commercially attractive. 
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Irr spective of which production method is used, it is nor- 
mally desirable to be able to increase the production level 
of a given protein. Thus, efforts are being made to increase 
the production, e.g. by inserting the gene encoding the pro- 
5 duct under the control of a strong expression signal, or by 
increasing the number of copies of the gene in the production 
organism in question. This latter approach may be accomplis- 
hed by inserting the gene into a multicopy plasm id which ge- 
nerally, however, tends to be unstable in the host cell in 
10 question, or by integrating multiple copies of the gene into 
the chromosome of the production organism, an approach which 
generally is considered more attractive because the stability 
of the construct tend to be higher allowing the gene to be 
stably maintained in the production organism. 

15 

EP 0 284 126 and EP 166 628 disclose methods for stably inte- 
grating one or more copies of a gene into the chromosome of a 
prokaryotic cell already harbouring at least one copy of the 
gene in question in its chromosome. According to EP 0 284 

20 126, a host cell comprising said gene is transformed with a 
DNA construct comprising another copy of the gene, whereby, 
after a suitable selection procedure, a cell is obtained 
which in its chromosome comprises two copies of the gene se- 
parated by an endogenous chromosomal sequence which is vital 

25 to the host cell and thereby ensures stable maintenance of 
the integrated gene. This procedure may be repeated so as to 
produce cells harbouring multiple copies of the gene in its 
chromosome • 

30 EP 166 628 relates to a process for amplifying a specific 
gene in the chromosome of a Bacillus strain thereby obtaining 
a cell harbouring a so-called "amplifiable unit" comprising 
the gene, the expression elements of the gene, and a gene 
encoding a selection marker inserted between two directly 

35 repeating sequences termed "duplicated sequences". The gene 
is introduced into the cell by a plasmid integration vector 
which is integrated in the Bacillus chromosome and which har- 
bours a marker gene, the gene to be amplified, and one of the 
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duplicated sequences, the other being present on the 
chromosome of the Bacillus cell. 

Both of the above described methods require that the entire 
5 gene to be amplified is insertable into the vector to be used 
in the amplification method and thus, are applicable only 
when the gene to be amplified is isolated and available on a 
vector to be used in the method. 

10 

BRIEF DISCLOSURE OF THE INVENTION 

The present invention relates to a generally novel method of 
amplifying a DNA sequence present on a genome of the cell, 
15 which as compared to the above described methods, has the 
advantage that there is no requirement that the DNA sequence 
to be amplified is available in its entirety. 

More specifically, in a first aspect the present invention 
20 relates to a method of amplifying in vivo a DNA sequence B 
present in a genome of a parent cell, which method comprises 

a) integrating in a genome of said cell a DNA construct com- 
prising the structure C-M-A-D, in which 

25 

A denotes a DNA sequence which is homologous with a genomic 
DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, 

30 

C denotes a DNA sequence which is homologous with a genomic 
DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, the 
35 sequence C being located in the opposite end of the sequence 
B as compared to A, 

D denotes a DNA sequence which is homologous with a genomic 
DNA fragment located distal for C as compared to B, and 
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M denotes a DNA sequence encoding a selection marker, 

b) selecting for cells in which the DNA sequence M has been 
integrated in the genome either upstream or downstream of the 

5 DNA sequence B together with the sequence A, which cells com- 
prise, in any orientation, the structure A-B-C-M-A-D, and 

c) propagating the cells selected in step b) under increasing 
selection pressure for the selection marker encoded by the 

10 DNA sequence M so as to obtain a cell which has obtained an 
increased number of genomic integrated copies of the DNA 
sequences B and M as compared to the parent cell. 

Integration of the DNA construct comprising the structure C- 
15 M-A-D into a genome of the parent cell results in a genomic 
structure in which the DNA sequence B together with a suit- 
able selectable marker is located between two directly repe- 
ated DNA sequences A, one of which originates from the genome 
in question and one from the DNA construct. When a strain 
20 comprising such structure is propagated under increasing se- 
lection pressure for the marker, the culture is enriched for 
cells containing duplications, triplications, and higher am- 
plifications of the genes between the two directly repeated 
sequences. Thus, it is contemplated that the number of copies 
25 of the DNA sequence of interest may constitute, 20, 50, 100 
or more the upper limit being the number of copies which 
become a too heavy burden for the cell. By use of the method 
of the invention it has been found that the amplified DNA is 
quite stable in the absence of selection for the marker M. 

30 

It will be understood that the amplification method of the 
invention has the important advantage over the prior art met- 
hods that the entire DNA sequence to be amplified does not 
need to be available for the method to be carried out. Only a 
35 part of the DNA sequence or its flanking regions need to be 
known. This is an advantage in that, although DNA isolation 
and sequencing methods have been substantially improved dur- 
ing the last decade, it is still laborious to isolate and 
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sequence a DNA sequence of interest and in fact, not always 
possible. 

In the present context, the term "genome" is normally 
5 intended to indicate a chromosome of the parent cell. 
However, the term is also intended to indicate any other 
genome present in the parent cell, an example of which is a 
plasmid, for instance a large stable plasmid present in the 
cell. 

10 

The term "homologous" as used about the DNA sequences A, C 
and D is intended to indicate a degree of identity between 
any of these sequences and the corresponding parts of the 
genome, which is sufficient for homologous recombination to 
15 take place. Preferably, the DNA sequences show identity or 
substantial identity for at least 8 consecutive base pairs 
with the corresponding parts of the genome. However, the DNA 
sequences may be longer, e.g. comprising up to several thou- 
sands nucleotides. 

20 

The term "flanking" is intended to indicate that the DNA 
sequence A or C is homologous with the genomic sequence 
located up to, but not extending into the DNA sequence B. The 
term "overlapping" is intended to indicate that the DNA 
25 sequence A or C is homologous to a part of the genomic 
sequence which is constituted by one of the ends of the DNA 
sequence B and the sequence immediately outside this 
sequence. 

30 The term "located distal for C" as used about the DNA 
sequence D is intended to be understood in its conventional 
meaning, i.e. that the DNA sequence D is homologous with a 
genomic sequence located on the side of the genomic sequence 
homologous to the DNA sequence C which is opposite to the 

35 position of the DNA sequence B to be amplified. The distance 
between the genomic sequences homologous with the DNA 
sequences C and D may vary from the situation where C and D 
are identical or partly overlapping to a separation of sev- 
eral thousand basepairs. However, the DNA sequences between C 
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and D will eventually become deleted from the genome when the 
method of the invention is carried out. 

In another aspect the present invention relates to a cell 
5 comprising multiple copies of a DNA sequence comprising the 
structure M-B in its genome , in which M denotes a DNA 
sequence encoding a selection marker and B denotes a DNA 
sequence encoding a desirable polypeptide, the multiple 
copies of the structure M-B being located between two direct- 
10 ly repeated sequences. 

In further aspects the invention relates to a DNA construct 
comprising the structure C-M-A-D and intended for use in the 
amplification of a genomic DNA sequence B, in which A, C, M # 
is D has the meaning indicated above, as well as a vector har- 
bouring the DNA construct. 

Finally, the invention relates to a process for producing a 
polypeptide encoded by a DNA sequence B comprising culturing 
20 a cell as defined above having integrated multiple copies of 
the DNA sequence B under conditions conducive to the produc- 
tion of the polypeptide and recovering the resulting 
polypeptide from the culture. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

The integration step a) of the method of the invention may be 
accomplished by any suitable method, the nature of which 

30 depends on the organism and DNA construct in question. First, 
the DNA construct must be introduced into the cell. The DNA 
construct may be introduced as such using techniques known in 
the art for direct introduction of DNA, e.g. by use of elec- 
troporation, transformation of competent cells, protoplast 

35 transformation, or ballistic transformation, but is suitably 
carried on a vector capable of affording integration of the 
DNA construct into a genome of the cell. 
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The vector is advantageously a plasmid or a bacteriophage 
which may be introduced into the parent cell by any technique 
suited for the vector and parent cell in question, including 
transformation as above, protoplast fusion, transf ection, 
5 transduction and conjugation. 

Upon introduction into the parent cell the DNA construct op- 
tionally in combination with vector-derived DNA is integrated 
into a genome by homologous recombination which takes place 
10 between the homologous sequences. In the appended Fig. 1 it 
is illustrated how a double recombination event between a 
genome and the DNA construct can give rise to a cell accord- 
ing to the invention, containing the structure A-B-OM-A-D in 
its genome. 

15 

When a vector is used for the integration of the DNA con- 
struct, a selection for cells having received the vector may 
be performed prior to the selection step b) of the method of 
the invention thereby improving the efficiency with which the 

20 integration of the DNA construct takes place* For this pur- 
pose, a vector which is able to replicate under certain (per- 
missive) conditions and unable to replicate under other (non- 
permissive) conditions may be used. The vector may, for 
instance, be one which is temperature-sensitive for repli- 

25 cation. Thus, the vector may be one which is unable to repli- 
cate at increased temperatures, which yet permit growth of 
the parent cell. The cells are initially cultured at a tem- 
perature permitting plasmid replication and subsequently 
after integration into the bacterial genome may have taken 

30 place, cultured at a temperature which does not permit 
plasmid replication so that the vector is lost from the cells 
unless integrated into the genome. 

The vector may further comprise a selectable marker. In this 
35 case, the cultivation at the non-permissive temperature may 
be conducted under selective conditions to ensure that only 
cells containing the integrated vector which includes the DNA 
construct and the selectable marker, will surviv . 
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The selectable marker may be any marker known in the art, for 
instance a gene coding for a product which confers antibiotic 
resistance to the cell, which confers prototrophy to an au- 
xotrophic strain, or which complements a defect of the host, 
5 (e.g. dal genes introduced in a dal" strain; cf. B. Diderich- 
sen (1986) . Cells surviving under these conditions will be 
cells containing the vector or cells in which the vector com- 
prising the DNA construct of the invention has been inte- 
grated. The selectable marker may, e.g., be excised from a 
10 known source or present on a vector, e.g. a plasmid, used for 
the construction of the DNA construct to be used in the 
method of the invention. 

In the selection step b) of the method of the invention, 
15 selection for cells comprising the structure A-B-C-M-A-D, in 
any orientation, are made. Such cells could be the result of 
a single recombination event, in which case the vector is 
still present in the genome, or could advantageously be the 
result of a double recombination event in which case the 
20 vector is not present in the genome* The double recombination 
event can be the result of two sequential single recombina- 
tion events, the first consisting of an integration into the 
genome of the vector containing the structure C-M-A-D, the 
second consisting of excision of the vector from the genome. 
25 The process is illustrated schematically in Fig. 2, from 
which it is apparent that integration via fragment C followed 
by excision via fragment D, or vice versa, will give a genome 
containing the structure A-B-C-M-A-D of the invention. 

30 This selection may be accomplished by growing the cells under 
selection pressure for the selection marker encoded by the 
DNA sequence M and analysing the thereby selected cells for 
the presence of the structure A-B-C-M-A-D, e.g. by use of 
conventional DNA analysis techniques, including restriction 

35 enzyme digestion and gel analysis combined with Southern 
blotting, or by use of PCR using suitable primers correspon- 
ding to characteristic parts of the structure A-B-C-M-A-D. 
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In one particular embodiment of the invention, a temperature- 
sensitive vector is used, which in addition to the structure 
C-M-A-D carries another selectable marker, Y . The vector is 
introduced into the parent cell at permissive temperature, 
5 selecting for either M or Y or both. Propagation is then con- 
tinued at a non-permissive temperature, and selection for 
either M or Y or both is maintained. Cells growing under 
these conditions will have the vector integrated into a geno- 
me (by either of the three fragments C, A or D) . Sub- 

10 sequently, cells are grown at a permissive temperature in the 
absence of selection pressure. This will allow plasmid repli- 
cation, excision of the integrated plasmid from the genome 
(again by any of the three fragments C, A or D) , and even- 
tually loss of plasmid from the cells. Cells are now selec- 

15 ted, which still contain the selection marker M, and such 
cells screened for the presence of the selection marker Y, 
e.g. by replica plating. Such cells can only arise by inte- 
gration via fragment C followed by excision via fragment D, 
or vice versa, and contain the structure A-B-C-M-A-D in a 

20 genome. 

The DNA sequence M present in the DNA construct to be inte- 
grated by the present method may encode any selectable 
marker, e.g. of any type as described above in connection 

25 with the marker optionally carried by the vector to be used 
in the method of the invention. Thus, the DNA sequence M may 
encode an antibiotic resistance such as resistance to 
kanamycin, tetracyclin, ampicillin, erythromycin, 
chloramphenicol, or a resistance to various heavy metals such 

30 as selenate, antimony or arsenate. 

It will be understood that the increased number of genomical- 
ly integrated copies of the DNA sequences B and M obtained in 
the propagation step c) of the method of the invention is the 
35 result of successive recombination events between initially 
the two copies of the DNA sequence A (directly repeated) sur- 
rounding the DNA sequences B and M. It may be possible to 
control the amplification of the DNA sequence B and thus 
arrive at a predetermined number of copies in terms of the 
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selectable marker used and the strength of the selection 
pressure used in the propagation step c) . There is no theor- 
etical upper limit for the number of copies of the DNA sequ- 
ences B and M to be obtained in this step, but in practice 
5 the number of copies will be limited by the burden put on the 
host cell. 

It should be noted that once the DNA construct has been inte- 
grated in a genome of the parent cell, this may be cultured 
10 in the absence of selection pressure without substantial loss 
of the DNA construct or parts thereof from the. ceil. This is 
believed to be ascribable to the fact that the integrated DNA 
is incapable of autonomous replication, and is replicated 
together with the host genome in which it is integrated. 

15 

It will be understood that the novel method of the invention 
is generally applicable for the amplification of a DNA sequ- 
ence present in a genome irrespective of the type of cell or 
genome. The only restriction as to the nature of the cell is 
20 that the cell is one which may be transformed or which may 
otherwise allow for introduction of foreign DNA. The cell may 
comprise one or more genomes, e.g. in the form of plasmids or 
chromosomes • 

25 For instance, the parent cell may be a microbial cell, an 
insect cell, a plant cell, a vertebrate cell, or a mammalian 
cell. When the parent cell is a microbial cell it may be a 
prokaryotic or a eukaryotic cell such as a bacterial or 
fungal (including yeast) cell. 

30 

When the cell is a bacterial cell it may be a cell of a gram- 
positive bacterium such as Bacillus . Streptomvces and Pseudo- 
monas, e.g. a cell of Bacillus subtil is . Bacillus lichenifor- 
mis, Bacillus lentus . Bacillus brevis. Bacillus stearothermo- 
35 philus . Bacillus alkaloohilus , pacillus amylo^iquef aciens, 
Bacillus coaaulans , Bacillus circulans . Bacillus lautus . Ba- 
cillus thurinaiensis , Streptomvces lividans or streptomvces 
murinus . or a cell of a gram-negative bacterium such as 
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Escherichia and Pseudomonas . Other examples of bacterial 
cells include cells of archaebacteria such as Pyrococcus . 

When the cell is a fungal cell it may be a yeast cell such as 
5 a cell of Saccharomvces or Schizosaccharomyces , or a cell of 
a filamentous fungus such as a cell of Aspergillus , e.g. 
nicrer , A* nidulans or A^ orvzae . 

The DNA sequence B to be amplified may be native to the par- 
10 ent cell or may alternatively be one which is not native to 
the parent but which has been cloned from another organism 
(e.g. of the type described above) or which has been syn- 
thesized and subsequently introduced into the host chromosome 
or another host carried genome by any convenient process, 
is e.g. crossing-over, prior to the integration of the DNA con- 
struct of the invention. The DNA sequence B may be introduced 
in its entirety or may be assembled in the host genome in 
question, e.g. by successive introduction of constituent 
sequences of sequence B. This latter approach is of particu- 
20 lar use when the DNA sequence B is unclonable in its 
entirety. 

The DNA sequence B may be one having or encoding any func- 
tion. For instance, the DNA sequence B may comprise an open 

25 reading frame, e.g. encoding a structural or regulatory pro- 
tein or polypeptide, and may be a single gene, a cluster of 
genes or an operon. The DNA sequence B may further comprise 
one or more regulatory signals involved in the expression 
from the open reading frame, such as transcription or trans- 

30 lation termination or initiation sequences. 

Preferably, the DNA sequence B comprises an expressible gene 
which may contain all necessary regulatory sequences such as 
a promoter, a terminator, a ribosome binding site, etc. 

35 

Normally, the DNA sequence B to be amplified is one encoding 
a desirable product such as an enzyme, a hormone, an 
antigenic component, an immunoactive protein or peptide, a 
growth factor, an allergen, a tumor associated antigen, a 
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blood protein, and the like, in other words any kind of indu- 
strially useful product, the production of which is desi- 
rable. 

5 Examples of enzymes of interest include amylolytic, lipolytic 
and proteolytic enzymes, transferases, isomerases, 
peroxidases, oxidases etc. In particular it is preferred that 
the DNA sequence B encodes a protease, a lipase, an amylase, 
a galactosidase, a pullulanase, a cellulase, a glucose 
10 isomerase, a protein disulphide isomerase, a CGT'ase (cyclod- 
extrin gluconotransf erase) , a glucose oxidase, a glucosyl 
transferase, or a xylanase. 

Examples of other useful products include insulin-like growth 
is factors, insulin, human or bovine growth hormone, human blood 
coagulation factors, inter leukin, tPA etc. 

Alternatively, the DNA sequence B may comprise one or more 
genes encoding a biosynthetic pathway, one or more genes en- 
20 coding elements of the cells transcription, translation or 
protein secretion apparatus (for instance sigma factors or 
sec genes of procaryotic cells), a regulatory factor acting 
in the cell or a metal resistance, or the DNA sequence B may 
complement an auxotrophic mutation of the parent cell. 

25 

From the above disclosure it will be understood that the DNA 
sequences A and C may be homologous with any genomic sequence 
overlapping or flanking the DNA sequence B. When the DNA 
sequence B is a gene, the DNA sequence A or C may advantage- 
30 ously be homologous to a full or partial promoter sequence 
upstream of the coding part of the DNA sequence B * An example 
of such construct is shown in Example 1 hereinafter. 

The DNA construct used in the method of the invention may be 
35 synthesized through a series of genetic manipulations employ- 
ing methods and enzymes known in the art. Typically, each of 
genomic sequences with which the DNA sequences A, C and D are 
to be homologous are identified by conventional DNA analysis 
methods . 
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For instance, a cDNA or genomic library may be prepared from 
the organism in question and the DNA sequence B to be 
amplified identified therein. When at least a part of the DNA 
sequence B is known, the DNA sequence B may be identified by 
5 screening for positive clones by conventional hybridization 
procedures, e.g. using oligonucleotide probes synthesized on 
the part of the DNA sequence B in accordance with standard 
techniques (cf. Sambrook et al., 1989), or more preferably, 
by use of polymerase chain reaction (PCR) using degenerate 
10 oligonucleotide probes prepared on the basis of the known 
part of the DNA sequence B. For instance, the PCR may be 
carried out using the techniques described in US Patent No. 
4,683,202 or by R.K. Saiki et al. (1988). 

15 When the nucleotide sequence of the DNA sequence B is 
unknown, and an expression product thereof is known, one may 
screen cDNA or genomic clones for an activity of the product 
and thereby identify a clone from which the activity is 
expressed. Subsequently, part of the DNA of the clone is iso- 

20 lated and sequenced and the location of the DNA sequence B or 
part thereof is identified. 

The DNA sequence B to be amplified may be identified by way 
of mutation, e.g. by transposon insertions that destroy the 

25 cell's ability to produce the product of B, and parts of the 
DNA sequence of B may be determined e.g. by inverse PCR using 
primers corresponding to the transposon sequences. In this 
way, the DNA sequences comprising the ends of B and flanking 
regions may be determined, even if B may not be clonable 

30 either partly or in its entirety. 

In order to be able to prepare the DNA sequences A, C and D 
at least the 5' and 3' ends of B (including at least suffi- 
cient sequence data to allow specific binding of a probe or 
35 PCR primer, e. g. 12 nucleotides) should be known. Once the 
sequences of these ends have been identified, the DNA flan- 
king or overlapping both ends of the DNA sequenc B may be 
identified, e.g. by hybridization or PCR analysis and subse- 
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quently sequenced. On the basis of -these sequences the DNA 
sequences A, C and D are prepared* 

The DNA A, C, D and M may be prepared synthetically or may be 
5 of cDNA or genomic origin , e.g. isolated from a genomic or 
cDNA library by use of the above described methods. 

Alternatively, the DNA sequence of the DNA construct of the 
invention may be prepared synthetically by established stan- 

10 dard methods, e.g. the phosphoamidite method described by 
Beaucage and Caruthers (1981), or the method described by 
Matthes et al. (1984). According to the phosphoamidite 
method, oligonucleotides are synthesized, e.g. in an automa- 
tic DNA synthesizer, purified, annealed, ligated and cloned 

15 in appropriate vectors. 

Finally, the DNA construct may be of mixed genomic and syn- 
thetic, mixed synthetic and cDNA or mixed genomic and cDNA 
origin prepared by ligating fragments of synthetic, genomic 
20 or cDNA origin (as appropriate) , the fragments corresponding 
to various parts of the entire recombinant DNA molecule, in 
accordance with standard techniques. 

As indicated above, the DNA sequence B is advantageously one 
25 which codes for a polypeptide of interest, and the present 
invention consequently further relates to a process for pro- 
ducing a polypeptide of interest, comprising culturing a cell 
according to the invention containing multiple copies of a 
DNA sequence comprising the structure M-B in a genome, in 
30 which B encodes the polypeptide of interest, under conditions 
conducive to the production of the polypeptide and recovering 
the resulting polypeptide from the culture. The polypeptide 
produced by the present process may be any of the products 
listed above such as an enzyme, e.g. a protease, amylase or 
35 lipase. 

The present invention is further illustrated in the appended 
drawing in which 



WO 94/14968 



15 



PCT7DK93/00438 



Fig. 1 illustrates a double recombination event between a 
genome and a DNA construct of the invention which results in 
a cell containing the structure A-B-OM-A-D in its genome, 

5 Fig. 2 illustrates a double recombination event which is the 
result of two sequential single recombination events, the 
first consisting of an integration into the genome of the 
vector containing the structure C-M-A-D, the second consist- 
ing of excision of the vector from the genome, resulting in a 
10 genome containing the structure A-B-C-M-A-D of the invention. 
Various possibilities of integration and excision, respect- 
ively, are illustrated, 

Fig. 3 is a restriction map of plasmid pDN1981, 

15 

Fig. 4 is a restriction map of plasmid pSJ1985, 
Fig. 5 is a restriction map of plasmid pSJ2024, 
20 Fig. 6 is a restriction map of plasmid pSJ980, 
Fig. 7 is a restriction map of plasmid pSJ192 6, 
Fig. 8 is a restriction map of plasmid pSJ2059, and 

25 

Fig* 9 illustrates genomic maps and integration events 
referred to in Example 1, of which 

A illustrates the amvL gene in the B. lichenif ormis chromo- 
some, 

30 B integration via promoter fragment (ooooooo) , 

C integration via 'amyL fragment (the 5' part of the coding 
sequence **********) , 

D integration via the downstream amyL fragment (downstream of 
the coding sequence xxxx) , 
35 E excision of plasmid from integrant type C via the homology 
downstream of the coding sequence (xxx) , 

F amplification of the P-amyL-kanB region (initially via the 
two promoter regions oooooo) . 
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The invention is further illustrated in the following 
examples which is not, in any manner, intended to limit the 
scope of the invention as claimed. 

5 MATERIALS AND METHODS 

Strains: 

Bacillus lichenif ormis SJ1904 is the a-amylase producing 
strain derived from strain SJ1707 by integration/ excision of 
10 plasmid pSJ1755 as described in example 6 of WO 93/10249, the 
contents of which is incorporated herein by reference. 



Bacillus subtilis DN1885: an amv E. amv R2 . spo * . Pro + deriva- 
tive of B. subtilis 168, (Diderichsen et al.,1990). 

15 

Media; 

TY: Trypticase20 g/1 

Yeast extract 5 g/1 

FeCl 2 .4H 2 0 6 mg/1 

20 MnCl 2 .4H 2 0 1 mg/1 

MgS0 4 .7H 2 0 15 mg/1 

pH 7.3 

BPX: Potato starch 100 g/1 

Barley flour 50 g/1 

25 BAN 5000 SKB 0.1g/l 

Sodium caseinate 10 g/1 

Soy Bean Meal 2 0 g/1 

Na 2 HF0 4 , 12 H 2 0 9 g/1 

Pluronic 0.1g/l 

30 

LB agar: Bacto-tryptone 10 g/1 

Bacto yeast extract 5 g/1 

NaCl 10 g/1 

Bacto agar 15 g/1 
35 Adjusted to pH 7.5 with NaOH 
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GENERAL METHODS 

The experimental techniques used to construct the plasmids 
were standard techniques within the field of recombinant DNA 
5 technology, cf . Sambrook et al. (1989) . 

Restriction endonucleases were purchased from New England 
Biolabs and Boehringer Mannheim and used as recommended by 
the manufacturers, T4 DNA ligase was purchased from New Eng- 
10 land Biolabs and used as recommended by the manufacturer. 

Preparation of plasmid DNA from all strains was conducted by 
the method described by Kieser, 1984. 

15 Transformation of B. subtilis 

Competent cells were prepared and transformed as described by 
Yasbin et al., 1975. 

Transformation of B. lichenif ormis 
20 Plasmids were introduced into B. lichenif ormis by 
polyethylene glycol-mediated protoplast transformation as 
described by Akamatzu, 1984. 

Amylase activity was determined with the Phadebas® Amylase 
25 Test kit from Pharmacia Diagnostics as described by the supp- 
lier. 

EXAMPLES 

30 

EXAMPLE 1 

Amplification of an amylase coding gene 

This example illustrates the amplification of an amylase cod- 
35 ing gene present in the chromosome of the B. lichenif ormis 
strain SJ1904. The strain constructed according to this 
example is one, which in its chromosome in the following 
order contain: 
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1) The amylase promoter, 2) The amylase structural gene, 3) a 
kanamycin resistance gene, and 4) another copy of the amylase 
promoter. The two copies of the amylase promoter in this case 
functions as the directly repeated DNA sequences A. 

5 

Selection for growth at increasing levels of kanamycin is 
shown to lead to amplification of the amylase-coding gene 
(including promoter) and the kanamycin resistance gene. 

10 Plasmid constructions 

All plasmids were constructed in B . subtilis DN1885, select- 
ing for kanamycin resistance (10 jtg/ml) . 

pDN1981 (fig* 3) contains the B. lichenif ormis a-amylase 
15 (amyL) gene and has been described by J0rgensen et al, , 1990. 

PSJ1985 (fig. 4) contains the amyL promoter (P myL ) , followed 
by a 210 bp fragment which originally was situated immediate- 
ly downstream of the amyL terminator sequence. The fragment 
20 was PCR amplified from pDN1981 with primers LWN3226 + LWN3223 
(Table 1), digested with Ndel and Hindlll, and ligated to the 
4 kb Ndel-Hindlll fragment from pDN1981 to give pSJ1985. 

PSJ2 024 (fig. 5) contains this combination of promoter and 
25 downstream fragment on a temperature-sensitive plasmid based 
on pE194 (Horinouchi and Weisblum, 1982b) . It was constructed 
by ligation of the 1.7 kb Bglll-Hindlll fragment from pSJ1985 
to the 4.9 kb Bglll-Hindlll fragment of pSJ980 (Fig. 6). 
pSJ980 is described in WO 93/10249. 

30 

pSJ192 6 (fig. 7) contains the amyL gene including its ter- 
minator sequence, but has been deleted of the sequences down- 
stream of the terminator (the downstream 210 bp fragment con- 
tained on pSJ1985 is thus not present on pSJ1926) . A 0.5 kb 
35 fragment from pDN1981 was PCR amplified with primers LWN3224 
+ LWN3227 (Table I), digested with Sail and Hindlll, and 
ligated to the 5.2 kb Sall-Hindlll fragment from pDN1981, 
giving pSJ1926. The Sall-Hindlll fragment of pSJ1926 derived 
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by PCR amplification has been DNA sequenced and contains no 
PCR induced mutations. 

pSJ2059 (fig. 8) contains a 1 kb fragment of the amyL gene 
5 just including the terminator sequence, a kanamycin resi- 
stance gene, the amyL promoter, and finally the fragment 
downstream from the amyL terminator, all on a temperature- 
sensitive origin. pSJ1926 was digested with EcoRI and Kpnl, 
and the 1 kb fragment inserted between EcoRI and Kpnl in 
10 pSJ2024, to give pSJ2059. 



Table 1 



List of primers 

15 <BooRI> 

IJKN3223: 5' -GAA TIC TCA TCT TIG ACA GC -3' (SEQ ID #1) 
pos. 1-20 in pCN1981v2 seque n ce 

<-XbaI-x-NdeI-> 

20 UWN3226: 5'-GAC TIC TAG ACA TAT GZEA AAT TIC GIT GAT TAC ATT -3' (SEQ ID #2) 
pos. 2221-2240 in am/Lv2 sequence 

<Hirdm 

IWN3227: 5'-GAC TCT OCA GAA GCT TAA AAT AAA AAA AOG GAT TTC -3' (SEQ ID #3) 
25 pos. 2210-2190 in arayLv2 sequence 

IWN3224: 5'-ATC ATA CAC AGC COG GGC AA -3' (SEQ ID #4) 
pos. 1690-1710 in anyLv2 sequence 

30 IWN3554: 5'-<3IT GAC GAG ACA TEA OG -3' (SEQ ID #5) 
pos. 1217-1201 in kanB sequence 

<-NbeI-> 

IHN3208: 5'-TGA GIC AGC TAG GAA CIG TCA TCA AAC AAC AAA AAC GGC TIT AOG OC 
35 -3' (SEQ ID #6) 

pos. 622-650 in amyLv2 sequence. 
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Transformation of B. lichenifonnls 

pSJ2059 was introduced into B . licheniformls SJ1904 by proto- 
plast transformation, selecting for erythromycin resistance 
(5 /xg/ml) . One transformant thus obtained was strain SJ2127. 

5 

Integration 

SJ2127 was streaked on LB plates with 10 ng/ml kanamycin and 
incubated at 50°C. As pSJ2059 is temperature-sensitive for 
replication, only cells containing a chromosomally integrated 
10 copy of the plasmid will give rise to colonies* 

pSJ2059 contains three different regions of homology to the 
chromosomal amyL region in SJ1904, and integration is poss- 
ible by recombination at any of these three regions. This 
is would give strains, in which the chromosome would look as 
indicated in fig. 9, B, C or D. 

A plasmid integrated as in fig. 9B would not be able to 
excise so as to give the wanted strain. 

20 

A plasmid integrated as in fig. 9C could give the wanted 
strain if excision took place by recombination at the down- 
stream fragment. 

25 A plasmid integrated as in fig. 9D could give the wanted 
strain if excision took place by recombination at the amyL 
structural gene fragment. 

8 colonies from the 50 °C plate was checked by PCR amplifi- 
30 cation, using primers LWN3208 + LWN3554 (table I) . The reac- 
tions were performed directly on material obtained by re- 
suspending and boiling the cells in TY medium. The position 
of the primers is indicated on fig. 9, B, D and E. 

35 No PCR amplified fragment should be obtained from a B-type 
integrant, whereas a C-type should give a 2.7 kb fragment, 
and a D-type a 7 . 5 kb fragment . 
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From 5 of the 8 colonies, a 2,7 kb fragment was observed, 
indicating that the integration in these cases had taken 
place via the amyL structural gene fragment, giving the C- 
type of integrants. These were then propagated in TY medium 
5 at 30°C, to allow excision and loss of the plasmid. Following 
three transfers in TY medium, Kana R Erm s colonies were found 
by replica plating. The erythromycin sensitivity indicates 
loss of the plasmid. The 2.7 kb fragment could still be pro- 
duced by PGR amplification from these colonies, as expected 
10 if excision had taken place by recombination at the down- 
stream fragment, giving the result shown in fig. 9E. One 
strain obtained from each of the 5 individual 50 °C colonies 
were kept, as SJ2147-2151. 

15 Amplification 

The a-amylase (amyL) + kanamycin resistance genes in strains 
SJ2148 and SJ2150 were amplified by the following procedure: 

The strains were inoculated in 10 ml TY medium + 10 pg/ml 
20 kanamycin and shaken at 37 °C overnight. New 10 ml cultures 
containing 20, 50, 100, and 200 tig/ml kanamycin were inocu- 
lated with 100 til of the 10 ng/jnl culture, and shaken at 37 °C 
overnight. 10 ml cultures containing 500, 1000, 1500, 2000, 
and 2500 /xg/ml kanamycin were inoculated with 100 pi of the 
25 200 /xg/ml culture. 

The 2 000 and the 2500 ^g /ml cultures were incubated for 4 
days, the others harvested after overnight growth. Aliquots 
of all cultures were frozen in 15 % glycerol, and cells har- 
30 vested for preparation of chromosomal DNA. 
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Strains isolated 
Mother strain: SJ2148 SJ2150 



Kanamycin 
5 concentration 
/xg/ml 



10 SJ2172 SJ2182 

20 SJ2173 SJ2183 

10 50 SJ2174 SJ2184 

100 SJ2175 SJ2185 

200 SJ2176 SJ2186 

500 SJ2177 SJ2187 - 

1000 SJ2178 SJ2188 

15 1500 SJ2179 SJ2189 

2000 SJ2180 SJ2190 

2500 SJ2181 SJ2191 



Chromosomal DNA from the above strains was digested with 
20 Bglll, which should give a 4.1 kb fragment derived from the 
amplified DNA (see fig. 9F) . This fragment is visible in an 
EtBr-stained gel even in the strains selected at 10 /xg/ml 
kanamycin, becomes increasingly conspicuous at 20 and 50 
Mg/ml, and stays at the high level in the rest of the 
25 strains. 

Yield effect of amplification 

Shake flasks with BPX medium were inoculated directly from 
the glycerol-frozen cultures, and shaken at 300 rpm at 37 °C. 

30 

The a-amylase yields obtained with the amplified strains were 
compared to the yield obtained with strain SJ1904. 
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Experiment A 


Experiment B 






Kanamycin 
in shake 
flask 
/xg/ml 


7 days 


4 days 


A vc 
D uay & 




Strain 


Rel. Yield 


Rel. Yield 


Rel. Yield 




£* TO 1 TO 


10 
0 


2.76 
2.6 






5 


C TO TOO 


20 
0 


. 3.44 
3.04 


1. 92 


2 . 72 




C TO 1*7/ 


50 
0 


2.68 
2 .72 








o to nc 


100 
0 


3.24 
2 .84 


1.84 


2*88 




SJ217 6 


200 
0 


3.2 
3.24 


1.84 


2.84 




SJ2177 


500 
0 


0.72 
3.24 


1.76 


2.76 


10 


SJ2182 


10 
0 


0.48 
2.0 








SJ2183 


O ft 

0 


J • DO 

3 . 68 


2.04 


2.6 




SJ2184 


50 
0 


2.96 
2.8 








SJ2185 


100 
0 


2.8 
3.2 


1.44 


2.32 




SJ2186 


200 
0 


2.96 
2.92 






15 


SJ2187 


500 
0 


0.6 
3.56 


1.68 


2.6 




SJ1904 


0 


1.00 


0.6 


1.00 



It is apparent that the amplified strains all produce more a- 
20 amylase than does the parent strain. 



EXAMPLE 2 

25 Amplification of a CGTase coding gene 

This example illustrates the amplification of a gene coding 
for a cyclodextrin glycosyltransf erase (cgrtA) . The gene was 
originally cloned from a Thermoanaerobacter sp. and inserted 
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in one copy into the chromosome of a Bacillus ;Lichenif ormis 
strain, replacing the endogenous alpha-amylase gene (amyL) of 
that strain. The CGTase gene was combined with an efficient 
mutant version of the alpha-amylase promoter and the alpha- 
5 amylase signal peptide on the plasmid used in this process, 
and transformation of B. licheni formi s with the recombinant 
construct was only succesful when a spontaneous recombination 
event transferred the amyL-cgtA gene to the chromosome under 
control of the wild-type amyL promoter • A later recombination 
10 step was then used to introduce the mutant promoter in front 
of the chromosomal amyL-cgtA gene. This work, resulting among 
others in strain SJ1707 used in the present example, has been 
described in WO 93/10249 and WO 93/10248. 

15 As the inventor was unable to obtain transf ormants of B. li- 
chenif ormis with plasmids containing the amyL-cgtA gene ex- 
pressed from the mutant promoter, amplification of this ex- 
pression cassette in the chromosome was not possible by 
methods which required the introduction of the entire cas- 

20 sette in one step, but was possible by the method described 
in the present invention. 

The strain constructed according to this example is one, 
which in its chromosome in the following order contain: 

25 

1) The mutant amyL promoter, 2) the amyL-cgtA gene, 3) a 
kanamycin resistance gene, and 4) another copy of the mutant 
ajnylr promoter. The two copies of the amyL promoter in this 
case functions as the directly repeated DNA sequences A. 

30 

Selection for growth at increasing levels of kanamycin is 
shown to lead to amplification of the amyL-cgtA gene, includ- 
ing the mutant promoter, and the kanamycin resistance gene. 

35 The chromosome of SJ1707 contains a fragment of the amyL gene 
distal to the amyL-cgtA construct (see WO 93/10249) . Plasmid 
pSJ2059 could therefore be used as a tool to construct an 
amplifiable derivative of strain SJ1707 in the same manner as 
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it was used in example 1 for amplification of the amylase 
gene of B. lichenif ormis . 

Transformation : 

5 pSJ2059 was introduced into B. lichenif ormis SJ1707 by proto- 
plast transformation, selecting for erythromycin resistance 
(5 Mg/ml) at 30°C. 

One transformant obtained was kept as SJ2285. 

10 

Integration: 

SJ2285 was streaked on LB plates with 10 jig/ml kanamycin and 
incubated at 50 °C overnight. 

10 colonies formed at 50 °C were propagated in TY medium at 
15 30°C to allow excision and loss of the integrated plasmid. 

Following one transfer in TY medium, Kana R erm s colonies were 
found by replica plating of the cultures derived from 7 of 
the 10 integrant colonies. 

Amplification (as in example 1) was attempted with 4 of these 
20 strains, and isolates eventually growing in 2000 /xg/ml kana- 
mycin obtained from 3 of these 4 . 

One amplified series was kept: 



25 


Kanamycin 






concent rat i on 






/xg/ml 


Strain 




10 


SJ2322 


30 


20 


SJ2323 




50 


SJ2324 




100 


SJ2325 




200 


SJ2326 




500 


SJ2327 


35 


1000 


SJ2328 




1500 


SJ2329 




2000 


SJ2330 
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SJ2323-SJ2326 were inoculated from SJ2322 (100 /il in 10 ml) , 
and SJ2327-SJ2330 were inoculated from SJ2326. 
Aliquots of all cultures were frozen in 15 % glycerol, and 
cells harvested for preparation of chromosomal DNA. 

5 

Southern analysis of chromosomal DNA from strains SJ2324 and 
SJ2328 digested by EcoRI revealed a 5.5 kb fragment as 
expected from the amplification of the amyL-cgtA + kanamycin 
resistance genes. The 5*5 kb fragment from strain SJ2328 was 
10 very conspicuous already in the EtBr-stained agarose gel. 

Yield effect of amplification: 

Shake flasks with BPX medium were inoculated directly from 
the glycerol-frozen cultures, and shaken at 300 rpm at 37 °C. 
15 The CGTase yields obtained with the amplified strains were 
compared to the yield obtained with strain SJ17 07. 
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Strain 


Kana- 
mycin 
/xg/ml 


exp. A 


exp. B 


exp. C 


7 days 


7 days 


8 days 


Rel. Yield 


Rel. Yield 


Rel. Yield 


SJ1707 


0 




1.0 


1.1 


SJ2322 


0 
10 


2.1 
2.2 


1.4 
1.7 




SJ2323 


0 
20 


1.9 
2.0 






SJ2324 


0 
50 


2.2 
2.4 


1.5 
1.7 


1.5 
1.8 


SJ2325 


0 

100 


1.8 
1.6 






SJ2326 


0 

200 


1.6 
1.8 






SJ2327 


0 

500 


1.9 
1.3 






SJ2328 


0 

1000 


1.9 
2.4 


1.6 
2.0 


1.4 
2.1 



15 It is apparent that the amplified strains produce more CGTase 
than does the parent strain. 

The stability of some of the strains from shake flasks with- 
20 out kanamycin was checked by platings on LB plates containing 
starch and scoring for halo formation. The ultimate effect of 
genetic instability would be loss of even the last copy of 
the CGTase gene, resulting in CGTase negative segregants 
which would be unable to produce halos on starch plates. 

25 

SJ2322: 100/100 positive (exp. A) 

300/300 positive (exp. B) 
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SJ2324: 100/100 positive (exp. A) 

300/300 positive (exp. B) 

120/120 positive (exp* C) 

5 

SJ2328: 200/200 positive (exp. A) 

500/500 positive (exp* B) 

120/120 positive (exp. C) 

10 None of the strains investigated lost the last copy of the 
CGTase gene under the conditions tested. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME: NOVO NORDISK A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 
(E) COUNTRY: DENMARK 

10 (F) POSTAL CODE (ZIP) : DK-2880 

(G) TELEPHONE: +45 44448888 

(H) TELEFAX: +45 4449 3256 

(I) TELEX: 37304 

15 (ii) TITLE OF INVENTION: DNA amplification 

(iii) NUMBER OF SEQUENCES: 6 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1, 

(EPO) 



25 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GAATTCTCAT GTTTGACAGC 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

10 (iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

15 GACTTCTAGA CATATGTAAA TTTCGTTGAT TACATT 
36 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
25 (iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GACTGTCCAG AAGCTTAAAA TAAAAAAACG GATTTC 
36 



35 (2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
5 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGATACACA GCCGGGGCAA 
10 20 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

20 (iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 

25 GTTGACCAGA CATTACG 
17 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGAGTCAGCT AGCAACTGTC ATGAAACAAC AAAAACGGCT TTACGCC 
47 
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CLAIMS 

1. A method of amplifying in vivo a DNA sequence B present in 
a genome of a parent cell, comprising 

5 

a) integrating in a genome of said cell a DNA construct com- 
prising the structure C-M-A-D, in which 

A denotes a DNA sequence which is homologous with a genomic 
10 DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, 

C denotes a DNA sequence which is homologous with a genomic 
is DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, the 
sequence C being located in the opposite end of the sequence 
B as compared to A, 

20 

D denotes a DNA sequence which is homologous with a genomic 
DNA fragment located distal for C as compared to B, and 

M denotes a DNA sequence encoding a selection marker, 

25 

b) selecting for cells in which the DNA sequence M has been 
integrated in the genome either upstream or downstream of the 
DNA sequence B together with the sequence A, which cells com- 
prise, in any orientation, the structure A-B-C-M-A-D, and 

30 

c) propagating the cells selected in step b) under increasing 
selection pressure for the selection marker encoding by the 
DNA sequence M so as to obtain a cell which has obtained an 
increased number of genomically integrated copies of the DNA 

35 sequences B and M as compared to the parent cell* 

2. The method according to claim 1, in which the DNA con- 
struct is carried on a suitable vector. 
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3 . The method according to claim 2 , in which the vector is a 
plasmid or a phage. 

4. The method according to claim 2 or 3, in which the vector 
5 is temperature-sensitive for replication. 

5. The method according to claim 4, in which the vector fur- 
ther carries a DNA sequence encoding a selection marker. 

10 6. The method according to any of the preceding claims, in 
which the parent cell is a microbial cell, a plant cell an 
insect cell, a vertebrate cell, or a mammalian cell. 

7. The method according to claim 6, in which the parent cell 
is is a bacterial or a fungal cell. 

8. The method according to claim 7, in which the bacterial 
cell is a cell of a gram-positive bacterium such as Bacillus 
or Streptomvces or a cell of a gram-negative bacterium such 

20 as Escherichia , the fungal cell is a yeast cell such as a 
cell of Saccharomyces . or a cell of a filamentous fungus such 
as a cell of Aspergillus . 

9. The method according to any of claims 1-8, in which the 
25 DNA sequence B comprises an open reading frame. 

10. The method according to claim 9, in which the DNA 
sequence B further comprises one or more regulatory signals. 

30 11. The method according to claim 9 or 10, in which the DNA 
sequence B is a single gene, a cluster of genes or an operon. 

12. The method according to any of claims 8-11, in which the 
DNA sequence B is heterologous to the parent cell and derived 

35 from a microorganism, a plant, an insect, a vertebrate or a 
mamma 1 . 

13 . The method according to claim 12 , in which the DNA 
sequence B is derived from a bacterium or a fungus. 
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14. The method according to any of claims 8-13, in which the 
DNA sequence B encodes an enzyme, a hormone, an antigenic 
component, an immunoactive protein or peptide, a growth fac- 
tor / an allergen, a tumor associated antigen, a blood pro- 

5 tein. 

15. The method according to any of claims 8-14, in which the 
DNA sequence B comprises one or more genes encoding a 
biosynthetic pathway, one or more genes encoding elements of 

10 the cell transcription, translation or protein secretion ap- 
paratus a regulatory factor acting in the cell or a metal re- 
sistance, or B complements an auxotrophic mutation of the 
host cell. 

15 16. The method according to claim 1, in which the DNA 
sequence B is a gene and the DNA sequence A is homologous to 
a full or partial promoter sequence upstream of the coding 
part of the DNA sequence B. 

20 17. The method according to any of the preceding claims in 
which the DNA sequence M encodes a product which confers an- 
tibiotic resistance to the parent cell, which confers proto- 
trophy to an auxotrophic cell, or which complements a defect 
of the parent cell. 

25 

18. The method according to claim 17, in which the antibiotic 
resistance is a resistance to kanamycin, tetracyclin, 
ampicillin, erythromycin, chloramphenicol. 

30 19. The method according to claim 17, in which the DNA 
sequence M encodes a product which confers resistance to a 
heavy metal such as selenate, antimony or arsenate. 

20. A cell comprising multiple copies of a DNA sequence com- 
35 prising the structure M-B in its genome, in which M denotes a 
DNA sequence encoding a selection marker and B denotes a DNA 
sequence encoding a desirable polypeptide, the multiple 
copies of the structure M-B being located between two direct- 
ly repeated sequences. 
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21. A cell according to claim 2 0 which is of microbial, 
plant, insect, vertebrate or mammalian origin. 

22. A cell produced by the method according to any of claims 
5 1-19. 

23. A vector harbouring a DNA construct comprising the struc- 
ture C-M-A-D and intended for use in the amplification of a 
genomic DNA sequence B, in which 

10 

A denotes a DNA sequence which is homologous with a genomic 
DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, 

15 

C denotes a DNA sequence which is homologous with a genomic 
DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, the 
20 sequence C being located in the opposite end of the sequence 
B as compared to A, 

D denotes a DNA sequence which is homologous with a genomic 
DNA fragment located distal for C as compared to B, and 

25 

M denotes a DNA sequence encoding a selection marker. 

24. The vector according to claim 23, which is a plasmid or a 
bacteriophage . 

30 

25. The vector according to claim 24 further comprising a 
temperature-sensitive origin of replication. 

26. A DNA construct comprising the structure C-M-A-D intended 
35 for use in the amplification of a genomic DNA sequence B, in 

which 

A denotes a DNA sequence which is homologous with a genomic 
DNA fragment either flanking or overlapping the DNA sequence 
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B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, 

C denotes a DNA sequence which is homologous with a genomic 
5 DNA fragment either flanking or overlapping the DNA sequence 
B to be amplified or being a subsequence of the DNA sequence 
B constituting one of the ends of said sequence B, the 
sequence C being located in the opposite end of the sequence 
B as compared to A, 

10 

D denotes a DNA sequence which is homologous with a genomic 
DNA fragment located distal for C as compared to B, and 

M denotes a DNA sequence encoding a selection marker. 

15 

27 . A process for producing a polypeptide encoded by a DNA 
sequence B comprising culturing a cell according to any of 
claims 20-22 having integrated multiple copies of the DNA 
sequence B under conditions conducive to the production of 
20 the polypeptide and recovering the resulting polypeptide from 
the culture. 
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